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© Digital high speed algorithmic data recovery method and apparatus. 



© An all digital data algorithmic recovery method and apparatus which operates at jitter greater than 25% and 
where run length is more than 1000 bits and which uses self calibrated delay elements to phase align a locally 
generated time ruler reference with the data average transition position to reliably establish the sampling time for 
retrieving data from an incoming binary sequence at the center of the data eye. The phase adjusted time ruler 
signal is used to sample transition positions of the data and the sampled data is statistically analyzed in a state 
machine wherein the time ruler is a broadband signal comprising a first and second base frequency and wherein 
the period of one of said frequencies is 



where F R equals the receiver local clock frequency and F T equals the frequency of the distant clock. 
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This invention relates to digital algorithmic data recovery for serial data transmit/receive svstems and 

" SSnce is m£e' T°* FT" *" "»* ^ ^^^^ 
which ZIT / COpend,n 9 Eur °P ean P ate * applications being filed concurrently herewith and 

wh.ch relate to aspects of our high data rate digital data/clock recovery system The applications 
; US applications: (1) "Digits. Full Range Rotating pZse'sZJ seTn^lr 
. Z u r L 24 1 " 3; inV6nt0r Bin GU ° ; AMD Docket Number ™94; and (2) "Digital Zabt 

AMD Doc^^ S6rial ° 8/021 - 712: «*» ** **■ ^ 1«* Lntor Bin^ 

w and 9330407°q ? wSr^ ? 10 ° Ur Copendin 9 Eur °P ean Patent applications 93304078.4, 93304077.6 

In sell JL t f 6 3SPeCtS ° f ^ d3ta r6C0Very SyStem described in the Wnt application 
In senal data fransm.Wece.ve systems, the clock information which provides the correct timing for data 

lnT7n 7 h 9enera " y en l bedded in the data ^rearn and there is no separate line or channel to Tarry a dock 
signal to the receiver. Clock frequency recovery and phase alignment with data is gene ally pLo'med 
befo e the rece.ved data can be recovered and de-serialized. Traditionally a Phase Locked Loo^ i, 

" tZTVl T* freqU6nCy ° f 3 V0 ' ta9e C0ntro " ed 0scil,at - < V C°) * ad'stec ^rSponsivfto he 
.ncommg data frequency. The phase acquisition can be accomplished by the same phas HSSToS, 

St" k P ^ '° Cked ' 00P dePendin9 ° n Whether a sin 9' e or a m "'«-'o°P system's used The PLL 

,h P ?cnL P H 3 f ^7 ° f C, ° Ck ' ° r the phase diffe ™ ce between the output of the VCO and 

the .ncom.ng data and generates an error signal which is filtered and converted to a confrol vo tage to drive 

onslnt he' VCO^l Pha 7 T^T TT ^ *"* ° f » e tW ° ' S e ' iminated "^ITa 

constant, the VCO output .s truly a ret.med clock since it has an established, known phase in relation to the 

data. In pract.ce, the data is often contaminated with various types of noises such that tinL or chase Mer 

rlZVT^ f.? e Vf' tranSiti0nS ' " the data Stream d0 not ^ arrive peS.y in" t^e bu 
rather at d.fferent "early- or ".ate" times, causing timing noise known as phase or timing jftter ncorrec 

fr^ e nr r0rS t H re eteCted and adjustments t0 tne VCO control are still attempted even !f the VCO 
frequency is the same or very close to the data frequency. The PLL is designed to reduce the effect of 

ecovS cTock is" LE^S * '° W P3SS ^ Such th^Z^ 

recovered clock is l.m.ted and good stab.l.ty of the loop achieved. However, heretofore this has been 

so oenTaS TT -, I ° ther prob,ems - Th * control voltage to the VCO is suscep^.e to nt erna"y 

m^TSt-ZTT" 0 T ° h iS m ° re S6riOUS Whe " the ° peratin 9 *W<* increase.. The low pass 
Mter funct.on often employs components such as large values capacitors * and resistors. These and Cher 
actors have made the monolithic integrated circuit implementation costly, especially when integrated S 
large scale, dense digital functions. Therefore, an all digital solution is highly £^b^S?T^ 
d.mens.ons of .ntegrated circuits becomes smaller such that a complicated logic tu£!£^JZ£ZZ 
35 d.e area than even a single capacitor component. V 

adiust ITriZT T di9 f ital T ^ reC0VSry approaches ' an ad i us * a b'e bias voltage or current is employed to 
adjust he delay value of a delay un.t in a ring oscillator to achieve frequency tuning or to adiust ohlll to 
match that of the incoming data. In these and other prior approaches, the techniques are glerS specified 

- w^JS^zzv™* jitt r t0,erance or operation frequency in ° ur ^-^S 

™ 93304078.4 data recovery ,s based on a discrete delay time ruler used for bit interval 
measurement. The t.me ruler in this ear.ier invention is generated from a local frequency reference source 

tn hm meas " rem ents .s m,t.ated upon the arrival of a data transition. Since the data are assumed 

Z^^ZZT°*! >0B m l ° f +/ " 25% ° f b " Peri0d ' the jitt6r t0 ' erance for -ch a s^Z-' 
45 educed to the same range. In our aforesaid pending application 93304077.6 jitter contributed from DCD 

STof an DCD co'moeltTI ^ U> ' n ° " 8,1 di9ita ' a ' 9 ° rithmic -ethod Pelk To-peak 

hi P c j om P ensated system w.th this approach may be reduced to less than + /-25% even when 

he ongma. unadjusted peak-to-peak jitter is larger than + /-25%. However, the DCD compensation 

techmque has a m.nimum improvement on jitter reduction if the DCD jitter is not dominant "EoTS 

ru U : P e r n e q;°h U NRTdl er s f 13 SCh6me iS ^ * * t0 Sh ° rt ™ da a ^ Tong 

Tentual ca usl T l\ ' (COn f Se . CUt,ve data bits withou * tranrttlon). the accumulated timing error wi?i 
!Zlnt V , 6 S ' nCe 3 f ' n,te 6rr0r 0f the time ruler (° r the de, ay value of the time ruler delay 

element) .s always present; in other words, the jitter tolerance of my earlier system diminishes as the dS 

, s h'^T 5 • SinC6 thS d6Si9n ° f S0me codin 9 systems limit " he run length ofTe encoded data fe 

Channel ^V" ^ <** < FDD '>' and the 8B ^B <^uil?Tro^ 

^SLZ^ ^f 6 " ^ US6fU '- H0W9Ver ' ° ther C ° din9 Systems such as ^ systems u ing 
scrambled data, may not guarantee the run length limit. A need exists to have a aeneral J\\ riinit*. Km- 

ruler based data recovery scheme which has much larger jitter to.erance than + S^of the f ofttlS! 
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and which does not deteriorate for longer run length, irrespective of the data or coding format employed. 

In the present specification we describe a general, all digital data recovery method and scheme, which 
retains its benefits without requiring any specific data or coding format, and which can be implemented and 
operated for high speed data transfer. 
5 We describe a digital scheme for data recovery that detects and locates the average transition position 

of an incoming serial data in respect to a locally generated time ruler reference to establish a correct phase 
refalionship between the incoming data and the local time reference. 

Time-calibrated delay elements, or the time ruler based reference, are used instead of the analog, PLL 
based systems for data recovery. 
10 We describe a method to phase align the locally generated time ruler reference with the data average 
transition position such that the time ruler measurement is initiated at the most appropriate position to 
obtain maximum jitter tolerance. 

We describe a combined, so called "integrated time ruler" which is a signal combination of a local 
reference clock signal from a local frequency source and local time rulers, where the time rulers are 
75 generated from time-regulated delay elements, which technique greatly reduces the accumulated error of 
the time ruler measurement such that the run length of the data can be extended to more than 1000 bits. 

We describe a complete data recovery scheme which is suitable for digital integrated circuit im- 
plementation. 

A signal representing a "compensated broadband time ruler", is phase-aligned with the detected and 

20 averaged data transition position. The compensated broadband time ruler is used as the recovered timing 
signal for reading and retrieving the data from an incoming serial binary sequence. Delay elements for 
generating the time ruler can be the same as those described in our aforesaid pending application, 
93304079.2. The incoming data could be preconditioned by passage through a digital jitter correction and 
signal pre-conditioner as described in our aforesaid pending application 93304077.6. The data could be 

25 sampled while traveling through a delay line comprising two time ruler elements. Each time ruler element 
has a nominal delay value equal to one half of the data bit period. The signal representing the integrated 
time ruler is first passed through a digital up/down phase shifter. The phase adjusted time ruler signal is 
then used to sample or register each data edge or transition position into a set of registers. If the signal of 
the integrated time ruler is properly aligned with the data, the timing edge of the time ruler signal should be 

30 aligned with the position known as the center of the "data eye". The center of the edge distribution of the 
average transition position is one-half bit interval from the eye center. If, however, the two are not aligned 
properly, a different set of registered data would be presented in the register sets. By observing and 
analyzing the sampled data in the register sets, information regarding the phase relation between the 
compensated integrated time ruler signal and the incoming data can be acquired. An up/down signal is then 

35 generated in a Phase Adjust Decision block to inform a rotating phase shifter to make adjustment such that 
the phase difference is reduced. This process continues until the phase relation of the two reaches a pre- 
specified condition. 

By combining the reference frequency and a time ruler signal which is real time calibrated to be a 
multiple of the reference frequency to form the compensated broadband time ruler we achieve the ability to 

40 remain properly frequency locked even for large extended run length on the incoming data. The signal 
representing the compensated broadband time ruler is not a regular clock signal since it is not a single 
frequency periodic signal, but rather a signal with two base frequencies: the reference clock frequency, and 
the frequency derived from the reference frequency using the delay time ruler technique. To be able to 
correct for a extended run length, the frequency, or the bit period accuracy, becomes crucial. In the time 

45 ruler data recovery method described in our aforesaid co-pending application 93304078.4 the time lapse of 
the "period" of the time ruler is determined by the regulated delay elements. There is always a finite time 
error in such delay elements, depending also on the achievable delay adjustment resolution. For the same 
achievable adjustment resolution, the percentile error becomes larger as data rate increases and bit period 
decreases. The accumulated error can become very significant if time rulers with the same time error are 

so simply "cascaded". The absolute as well as the relative error grows as the run length grows. One corrective 
method for such error would be to "compensate" the error periodically, before it becomes too big. This, of 
course, requires another time ruler or time reference with much higher timing accuracy. In fact, a local 
frequency reference source, which may run only at one Nth of the data frequency, can have much higher 
accuracy. Many systems including the FDDI standards specify that the local frequency source at the 

55 receiver end is allowed to have maximum frequency offset + /-50 ppm (parts per million) with respect to 
the frequency source on the transmitter end (or the incoming data frequency). For +/-50 ppm accuracy 
(which is not difficult to achieve with modern crystal-oscillator technology), less than +/-1%timing error 
would occur for the accumulation of 200 cycles of the reference clock, or 200 x N bit periods. The 

3 
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accumulated time error from a cascaded time ruler for so many cycles would be much larger. Our solution 
is to use the local reference clock signal (known as the parallel data clock or byte clock) to correct the 
delay generated time ruler. Assume that the reference dock signal is 1/N times the data rate, so that N time 
ruler periods at the incoming data frequency are generated within a reference clock (byte clock) period and 
s assume that each time ruler period has a timing error of + Te, then at the end of the Nth time ruler period 
h , JT T° r +(N X Te) iS accumulated - However, if the last time ruler period in each byte is 

^dhL t- a ,™ Unt ° f " (N ^ * he time err ° r iS fU " y C0 "Wted. By using our "Compensated 
broadband tame ruler" concept, the data run length is allowed to exceed thousands of bits without causing 
an error in recovered data. a 
io In the accompanying drawings, by way of example only: 

Fig. la is a functional diagram of a transceiver providing an all digital data recovery system employing a 
locally generated "Compensated broadband time ruler" and algorithmic data transition averaging 
Fig. 1b is the block diagram of an alternative embodiment of the system of Fig. 1a. which employs a 
preconditioning block. r^y* a 

Fig. 2a is the block diagram of the Compensated Broadband Time ruler Generator and Regulator 
Fig. 2b is the timing diagram for Fig. 1a and Fig. 1b. 
Fig. 2c is the block diagram of a DCD Preconditioner. 

Fig. 3 is the block and logic diagram of one embodiment of the edge distribution sampler block of 

figures i 3 and 1 b. 

Fig. 4a and 4b show timing diagrams and analysis 'used in the phase adjustment decision function 
Fig. 5a, 5b, 5c through 8a, 8b and 8c are illustrative diagrams analyzing the l-O test method 
Fig. 9a through 9e are illustrative summary diagrams analyzing the distribution test method for different 
data transition distributions. 

Fig. 10 is a chart showing the truth table combinations for generating the up/down commands in the 

25 phase adjust decision block. 

Fig. 11 is the block diagram of an embodiment of an alternative edge distribution sampler 

Fig. 12 is the schematic for the Bit Serializer of Fig. 2a. 

Fig. 13a is a block diagram for a 1/2 Tb delay element of Fig. 2a. 

Fig. 13b is a schematic for the delay cell of the 1/2 Tb delay. 
30 Fig. 14a is a block diagram of the Sampling Register of Fig. 2a. 

Fig. 14b is a block diagram of the Ruler Adjustment Decision Circuit of Fig. 2a. 

Fig. 14c is a timing diagram of the circuit of Fig. 14a. 

Fig. I4d is the logic truth table analysis of the Status Decode Logic of Fig. 14b. 
Fig. 15a is the time zoning analysis of the Adjust Update Control of Fig. 14b. 
35 Fig. 15b is the logic truth table for Adjust Update Control of Fig. 14b. 
Fig. 16 is a block diagram of the Digitally Adjustable Delay. 
Fig. 17 is the timing diagram for Fig. 16. 

Fig. 18 is the block diagram for the 360 Degree Phase Shift Range Calibrator. 
Fig. 19 is the block diagram of one embodiment of the Rotating Phase Shift Control 
40 Rg. 20 is the logic diagram for the register stages of Fig. 19. 

Fig. 21 is the table and chart of equations and the truth table for Fig. 20. 

DETAILED DESCRIPTION 



45 hrninV^ b ' 0Ck diaQram ° f 3 data transceiver "sing locally generated compensated 

broadband time rulers and data transition averaging. 

With reference to Fig. 1a, to transmit data, the transceiver 1 receives N-bit parallel data TDATA on bus 
11. serializes ,t and transmits out the serialized data SERO on line 12. The transceiver receives the serial 
incoming data SERIN on line 13, recovers the data from the output of D flip-flop 15 using a recovered and 

50 I^Tb IT h SB,TC , K !f 3 «W <* ^e compensated bro-^^Sr"S 

line 16. The deserialized N-bit parallel data as the recovered data RDATA are sent out on line 14 A 
reference frequency source generated clock signal LCLK as the local clock running at 1/N of the data 
frwncy ,s taken in on line 31. The accuracy of the LCLK or its frequency is specified in respect to the 
clock frequency of the remote transmitter/receiver. A "compensated broadband" time ruler signal BITCK 16 

55 equivalent to a bit-rate clock, is generated in the compensated broadband Time Ruler Generator and 
Regulator block 30, and output on line 16. This signal is then sent to digital adjustable delay 50, Fig. 16 of 
JL^E ft- haSe , Sh,fter 56 in which P hase shiftin 9 is performed, and from which the phase adjusted 
integrated time ruler signal SBITCK is output on line 16'. The BITCK signal is generated from a local 
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frequency reference source LCLK which has no certain phase relation with the incoming serial data SERIN 
on line 13 and thus can not be directly used to read or sample the data. The serial incoming data SERIN on 
line 13 may be preconditioned, Fig. 1b, to compensate duty cycle distortion by signal pre-conditioner 20\ 
so that the preconditioned incoming data is available on line 13'. Alternatively, the serial incoming data 

5 SERIN can be directly applied to the D input of D flip-flop 15 for data recovery, via connector 13" since this 
scheme has large jitter tolerance and data can be reliably recovered even in the presence of the duty cycle 
distortion jitter. In the latter case, only the two 1/2 Tb time ruler delay elements 22 and 23 in Ruler Delay 20 
are needed. The serial incoming data on line 13" is sent to the cascaded time ruler delay 22 and 23, each 
of which delays for a time lapse equal to half of the bit period. The edge distribution sampler 81 samples 

w the level status of nodes 25 through 29 to capture the location of data transitions while the data signal wave 
is traveling in time ruler delays. Different distribution patterns on nodes 25-29 will be observed if the 
sampling takes place at different phase instances. In this arrangement, the phase adjusted signal SBITCK 
16 f is used to strobe the sampler for capturing the status of the transitions. Since SBITCK 16* is also used 
as the clocking signal for D flip-flop 15 for reading the data on line 13", the most appropriate phase relation 

75 of SBITCK and the incoming data SERIN is that the SBITCK sampling edge is aligned with the center of the 
data eye. If no time jitter is present in the incoming data, all data transition should be captured at or in the 
neighborhood of node 27. In other words, if the clocking edge SBITCK is one half bit away from the SERIN 
data transition in time, it is called "phase in lock". In practice, there are always timing jitters associated with 
data edges. Distributed edges should be observed if a certain number of edges are captured. The sampled 

20 edge distribution information is sent to the phase adjust decision block 82 via bus 85. The captured 
distribution data are analyzed and decisions are made in block 82 to inform the rotating phase shift control 
56 of the direction of the phase error. If the retimed or phase adjusted clock signal SBITCK is not aligned 
with the center of the data eye, commands are sent to the rotating phase shift control of the phase shifter 
56, via line 88 UPDNEN (up/down adjustment enable) and 89 UPDN (up or down) to cause phase 

25 adjustments in steps. The process continues until phase alignment is achieved, and is resumed whenever a 
prespecified phase error due to frequency drift occurs. 

In Fig. 2a, a detailed block diagram is given for the Compensated Broadband Time Ruler Generator and 
Time Ruler Regulator 30. The function of the Time ruler Generator is more fully described in pending 
application 93304079.2. A plurality of cascaded adjustable delay elements 21' which each have delay of 

30 one-half T b are controlled by the up/down shifter 90' on bus 1000. With reference also to Fig.2b, timing 
diagrams, it is seen how the Local Reference Clock LCLK controls the regulation and calibration of the time 
ruler delay elements 21 such that each delay element 21* exhibits the time lapse equal to half of the bit 
period. 5X frequency multiplication is assumed in this example. Pulse generator 34 provides pulse 312, 322, 
etc. at node BT1, 301, at every LCLK rising edge 310, 321, etc., to generated the rising edge of the pulse 

35 with width equal to half of the bit period. Pulse generator 34 Fig. 2a is responsive to the output on line 33 
the first time ruler pulse which is subsequently delayed. The pulse generator 34 output is BT1. If the delay 
value of each 1/2 Tb delay element 21 * is calibrated to its nominal value, at every bit interval, a delayed 
copy of this pulse is generated at nodes BT2, 302 through BT5, 305. These pulses are signals responsive 
to the time ruler which assist in determining the transmitted and serialized data frequency. The pulse is 

40 "traveling" in the serial delay line composed of several cascading such time ruler delay elements 21 ', 
generating pulses in every bit interval at nodes Bt1 through Bt5 and providing frequency information in our 
data recovery process. Error will accumulate as the number of such consecutive bit interval increases. I call 
this time ruler the "bit rate" time ruler, since it is produced from a "byte rate" reference frequency clock 
signal LCLK and which provides time measurement at bit resolution. If infinitely small delay adjustment 

45 could be achieved in the delay regulation process, or if N such bit rate time ruler lapses equaled exactly 
one byte rate time ruler from which they were generated, then the bit rate ruler could be used a large 
number of times. However, N times Tb is not equal to LCLK period due to the achievable adjustable timing 
resolution. In serial link data transmission and recovery, we use the recovered signal or time ruler to 
"measure" the time lapse in the absence of data transitions. If each time ruler had a 1% timing error, a one 

so bit slip would occur after 100 bits. This is especially a problem in an all digital data recovery approach 
since only step or finite time or delay adjustment is available. However, the "byte rate" time ruler is very 
accurate and is not dependent on the adjustment resolution. The conventional accuracy of a crystal 
frequency source permits the use of the time measurement for hundreds of consecutive byte cycles without 
accumulating significant out of spec timing error. With reference to Fig. 2b, the signals from BT1, 301 

55 through BT5, 305 are combined in a logic OR circuit to produce a signal BITCK 16. I call the ORed output a 
"Compensated Broadband" time ruler, BITCK 16. I call it "compensated broadband" because it is a 
combination of two types of time rulers with different base frequency. Note from Fig 2b, that the broadband 
time ruler BITCK 16 acts like a bit rate clock, but is different from a regular bit clock. In the timing diagram, 

5 
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he cycles of BITCK are 5 in a group. Cycle T1 through T4 are identical in time because they are generated 
from delaying a signal by identical bit delay time ruler elements, while cycle T5 may be different than the 
other four if a timing error of the time ruler exists. In a conventional clock, Period T5 would end at the time 
indicated by time line 323 if it was determined by delaying signal BT4 by a time ruler delay element 
Actually, the rising edge of the 6th bit 333 is generated by edge 322, which is generated by the rising edge 
of the LCLK 31 at 321. There is a time gap between 323 and 333, which in fact is the accumulated timing 
error of the time rulers generated in period T1 and through T4. In terms of the time "measurement" of this 
example, bit time rulers are used for determining 4 bits within every 5 consecutive bits, while the byte rate 
time ruler is used to determine the rising transaction for the 5th bit. As a result, the frequency accuracy is 
only limited by the byte rate clock LCLK. With a given frequency offset specification of 100 ppm or so data 
recovery using this type of compensated broadband time ruler can stand a run length of hundreds of byte 
times or thousands of bit period times, i.e. periods without a data transition. 

The compensated broadband time ruler BITCK can be used to sample or recover the serial incoming 
data on l.ne 13 only if its clocking edge can be aligned with the center of the data "eye", show in waveform 
365. In Fig. 2b, BITCK 16 has to be delayed or phase shifted in the amount of Tdd to produce a phase 
shifted copy of the broadband time ruler SBITCK 16', the timing edge (rising edge) of which is aligned with 
the center of the data eye indicated by the dashed lines 344. When this alignment is accomplished the 
time between the rising edge of the phase shifted SBITCK and the average data transition position should 
be half of the bit period as shown in time interval 366. If the phase adjusted or aligned SBITCK is used to 
sample" or to "capture" the data transition positions, a edge distribution centered at half bit time away 
from the sampling instances indicated by dashed line 355 can be expected. 

Fig. 2C is the block diagram of a DCD preconditioner. The function of the DCD compensator 20' is 
more fully described in pending application 93304077.6. 

25 EDGE DISTRIBUTION SAMPLER (FIG. 3) 

In Fig. 3, one embodiment of the edge distribution sampler block 81 from Fig. 1a and its connection to 
the phase adjustment decision block 82 and to the time ruler delay elements 22. 23 are shown In this 
embodiment, SBITCK 16' is used as the clocking signal for the latches 810. which inputs are from nodes 25 
30 through 29. Signal arrived at node 27 is half bit time later than that seen at node 25, and signal arrived at 
node 29 is half bit later than that seen at node 27. Node 27 is named W0 because data edge or transitions 
are expected to be captured around there if the SBITCK is aligned with the center of the data eye or thev 
are "in lock" m^h^o oe no i — * — ... . . . ' " 
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Nodes 26 and 28 are taps from the delay element, the significance of their exact location will 
be subsequently discussed. For the purpose of explanation, I assume that there are taps in the middle of 
the delay element at this time. These five nodes make up several regions in respect to the sample clocking 
edge of SBITCK. Refer also to Fig. 4a for the regions: 

The Inner Region (I) -- region 101 where a data transition is captured when it is between the two middle 
nodes 26 and 28; 

node^ and^e^ Be9 '° n ( ° E > " r69i ° n 105 where 3 data transition is captured when it is between 
The Outer Late Region (OL) - region 106 where a data transition is captured when it is between 28 and 

29i 



45 27; 
29. 



The Outer Region (O) - the logical OR combination of OE and OL; 

The Early Region ( E) - region 103 where a data transition is captured when it is between node 25 and 

The Late Region (L) -- the region where a data transition is captured when it is between node 27 and 

Exclusive OR (XOR) gates 815 are used to determine if a transition is captured in one of the regions 
For example, ,f a transition is in the OE region while the SBITCK is sampling, the sampled binary level at 
so output 805 and 806 would be different such that the XOR output OE 821 is a "1 " If it is "in lock" or the 

San ?. Plm Lo 0Ck o SBITCK iS We " a,i9ned with the data eye ' there should be on| V a single transition captured 
so that 823 or 824 is "1 - and all other XOR outputs are "0"s. A "1 " in either 821 or 823 will make XE 825 
a_ 1 , meaning that the transition is captured in the "early" region. A "1 - in either 822 or 824 will make XL 
826 a "1 ", meaning that a transition is captured in the "late" region. Assuming a transition is captured in the 
55 early region so XE is a "1 " while XL is a "0", the clocking edge of the signal ISBITCK 16" is gated through 
by one of the AND gates 817, and the "early" counter 831 is incremented by one while the "late" counter 
832 rema.ns unchanged due to the fact that XL is a "0" preventing the clocking signal ISBITCK from 
incrementing the late counter 832. After a number of sampling cycles, the "early" counter 831 and the 

6 
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"late" counter 832 received the number of counts equals to the captured early and late transitions 
respectively. The test for finding and registering the differences of the two (called E-L test, meaning total 
early minus total late in a sampling period) are computed in up/down counter 841 and output to the phase 
adjust decision 82 via bus line 87. The same analysis applies to the "I" and "O" region for transitions 

5 captured in the "inner" and "outer" region respectively. The test for differences in captured transitions for 
the two (called l-O test) is computed in subtractor l-O 843 which holds the difference for transitions 
captured in Inner and Outer regions. The subtractor B-S 843 holds the differences for B (in both "early" and 
"late" region or in neither early or late) and S (only a single transition is captured). The test for the 
difference of the two is called B-S test and is also sent the to decision 82 via bus line 85*. When transitions 

w are captured in both "E" and "L" region or none are captured in both region, the outputs of XE 825 and XL 
826 are at the same logic level, so XOR 829 output is a zero, otherwise it is a "1". Counters B (for both) 
and S (for single) will be incremented accordingly. The contents of the counters and subtracters are the 
edge distribution information needed for the phase adjust decision block 82 to make adjustment decisions. 
Fig 5a, 5b, 5c through 8a, 8b and 8c are diagrams illustrating the edge distribution analysis for the 

75 phase adjust decision process. In Fig. 4a, the phase shifted integrated time ruler signal SBITCK 16' is 
shown and the detection regions are defined as described early. Data waveforms are shown in overlapped 
eye diagram format together with histogram information for the edge distributions for rising edges (402) and 
falling edges (403) respectively are shown as an increased concentration of lines where the histogram has 
higher concentration. Row (A) Fig. 4a & 4b illustrates the "in lock" situation (or 0 degree offset) when the 

20 rising edge of the SBITCK is aligned with the center of the data eye as indicated by line 401. Observations 
of the distribution in this case reveals with reference to Fig. 4b: 

1. Almost all the transitions are captured within the "I", 101, region. A few edges in the tails 402* of the 
distribution is within the "O" region. The l-O test result shown in column 111 should take position 
maximum value. 

25 2. Equal transitions are captured in "E" and "L" region because of the symmetry and the alignment of 
the distribution. E-L test result shown in column 112 should be zero. 

3. Only a single transition can be captured in each sampling, so B-S test in Column 113 should take the 
negative maximum value. 

4. Rising edge distribution represented by the cross hatched curve region 402 is centered at the early 
30 ("E") region, so RE (rising early) is much greater than RL (rising late), while FE (falling) is much smaller 

than FL (falling late) since the falling edge distribution curve 403 is centered at the late ("L") region. 
The same analysis technique is illustrated by the situation when SBITCK is 90 degree out of phase in 
row (B), 180 degree out of phase in row (C), and -90 (270) degree out of phase in row (D), each in respect 
to the center of the data eye. Note that if it is 180 degree out of phase, the result of the E-L test is the 

35 same as if it is aligned. Thus this test can not be used to distinguish between 0 degree or 180 degree out 
off phase. However, the l-O number is maximum with a change in sign, and will be a reliable test to 
distinguish the two. It also noted that the result of the l-O test in column 111 is the same for the event of 90 
degree or -90 degree out of phase situations, thus can not be used to differentiate these two states. This 
information is needed to determine the direction in up/down type of phase adjustment. However, E-L test in 

40 column 112 has the maximum opposing number for these two, so the E-L test is obviously the test to 
extricate if the SBITCK is leading or lagging the data. From the above analysis, it can be seen that test E-L 
or test l-O are a function of the SBITCK phase offset, and the two functions are 90 degrees out of phase. 

In Fig. 5 through 9, variations in width of the "I" region as well as the peak-to-peak value of the 
captured data transition distributions are considered. Fig. 5a, 5b and 5c illustrate the l-O test results when 

45 the width of the "I" region is large, medium and small respectively. The width of the "I" regions are shown 
in shaded areas. Since the l-O test is primarily for differentiating 0 degree offset and 180 degree offset, 
large "I" region width represented by l-O curve 551 in Fig. 5a has a greater slope near 180 degrees and is 
much more appropriate for the task than the small "I" region width represented by curve 553 in Fig. 5c. Fig. 
6a, 6b and 6c are for the same situations except that the total data edge distributions are depicted in 

so standard Gaussian Distribution with a medium peak-to-peak value of about + - 30% of the bit period. The 
same conclusion can be reached that large n l" region width should be used. Fig. 7a through 7c show the 
situations when the peak-to-peak value of the total edge distribution is large and Fig. 8 shows the situation 
when the peak-to-peak value of the data edge distribution is small. Summarizing, for all cases, a large "I" 
region width will make the distinction between no offset and 180 degree offset easier. Fig. 8a through 8c are 

55 for the situation when the peak-to-peak value of the edge distribution is small and the same conclusion can 
be reached. Fig. 9a - 9e summarizes the above discussion for different test and peak-to-peak value of the 
edge distributions. 
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«mf 5 u 6d9e distributions for three diff erent peak-to-peak values. Curve A B and C are for 

the E L m 7 t Sma "' mediUm ^ ,ar9e P eak " to -P eak values respectively. In Fig. 9b the Result o 

nLt < L k UnC . tl0n ° f th9 SBITCK 0ffS6t in res P ect t0 the mean <* the data edge distribution is 

« Eck tte A7abs e ol 0 Thr t es r ho e .d aSeS n^T"^ ThiS ^ &L * a ' S ° US6d l ° dete ™ e if ^e drc . Ts ,n 
lock . An absoUte threshold number d can be specified which is the maximum number allowed for thP 

oS n ?f 6 L ^ ^T' ° f ^ " ear ' y " «*» and the number of the "'ate" edges Ts samp.ng 

E L exceeds d and" uD-^r-r "° ^ M '° be Perf ° rmed " When ^absolute vaT o'f 

l^rTeoenTon if E L has Sh ' ft,n9 ° f ^ inte9rated time mler iS 6nab,ed - The u P /down *W 

on ae Penas on if E-L has positive or negative value. Note also in the fiaure- that the. rth^ 

w alignmenter for pA, P B and P C are different for the same threshold vaL d because o d fflrent data 

transition distributions. Data edge distribution with smaller peak-to-peak value as cu^e A has le^faLnment 

error, P A. D.gital filter functions may also be included to reject the effect from temoor^ SL« S 

« tZTJ J h- ?.r ,fl : 9C ' the r6SUlt ° f '-° test as a function of t^ phase offset is plotted for the hree 

Phase llSn 6 STe^CK^T^ — T ^ *' abS °' Ute * ^ iS th-ho'd % " 
?h^?n tit ^ the d3ta IS e ' ther Close to 0 de 9 ree or in the neighborhood of 180 degree 

=== 

results .mmed.ately ,n a response in frequency adjustment of the VCO. Low pass filter for sm^tS^ 

srs: ls rsz in n s , uch r al ° 9 p h LL ,oops to reject hi9h ^tsc 

frl,?l „ * 2 ' reta ' nmg the l0W fre£ l uenc y Phase variation information represents rea 

, Z 7Z y ™ P f dnftin9 - ln thiS di9ita ' appr ° aCh ' h0wever ' the ed 9* averaging mecromlJ^SSeS! 
If a PaS ' f, i en ? 9 f unct 'onality, since phase adjust decision command is not generated based on 'chase Tor 
o an individual edge, but rather on an averaged edge position or the average of the ^phase errors to^Z 

ESeHrP™^^ 

Furthermore, van.ble value adjustment steps may also be used depending upon the maoMud. of Z 
phasa error detected, and on the quadrant of the phase offset. A -FS- output signaUFas4tow 1* 
So, ml *° t 9 * ne " aMd: " *" Phm di "" en ~ te "«»• 'naHlTgleroMHa,,™* 

SB^S^aL^^^ 

samp ng pomtt 25 through 29 becomes the left group aril sampling points tsZl^tST^Z 
BIT SERIALIZER (FIG. 12) 

P^TdaS^JL^h ? '"V^ th9 bit S6rialiZer 30 ° f Fi9 " 22 is disclosed *hlch serializes the 
Z ^r*£ 7 L Ch Pr ° V,deS the " Com Pensated" clock called BUCK, Parallel data TDATA (+0) 

are latched mto the mput register and are valid at TD1 through TD5 at the rising edge 5 DL before le 
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rising edge of BT1 . TD1 through TD5 are complements of TDATA (4:0). Node 33 at the input of invertor 33' 
is pulled high by a weak-drive device 35 if none of the MOS switch SW1 - SW5 are on. When the pulse of 
BT1 arrives, it turns on MOS switch SW1 for the duration of 1/2 Tb. Since output TD1 , to which SW1 is 
connected is low, MTR or node 33 is pulled low for 1/2 Tb. This causes T-FF 32* to produce an output 
5 transition of SERO at 12. A half bit later BT1 returns to tow, turns SW1 off, causing XTR on node 34 to 
return to zero. This sequence is repeated as a gated readout of the register. If one of the bits in the input 
register is zero, no transition will occur at output SERO for that bit. This operation is more fully described in 
my earlier copending application, entitled "Digital Data Serializer and Time Delay Regulator", Serial Number 
07/901,312 filed June 19, 1992, AMD Docket A865. 

10 

DELAY UNITS (Fig. 13a) 

The delay line 21 of Fig. 2a which provides the time ruler signals BT1-BT5 is made up of a plurality of 
cascaded adjustable delay units 21 each of which nominally provide a delay of 1/2 Tb. Fig. 13a shows that 
15 the adjustable delay unit 21' is preferable made up of two parts: a course step unit 211 and a fine step unit 
212. The course step unit 211 is formed by two parts: a fixed-stage delay 221 and a variable-stage delay 
222. There are k stages of delay cell 250 (d1 through dk) in the fixed-stage delay 221 and m stages of 
delay cell 250 for the variable-stage delay 222. One of the embodiments of delay cell 250 is shown in Fig. 
13b, which employs 3 invertors. The delay from IN to OUT is defined as T step , while the delay from IN to 
DOUT is defined T dop . The delay from the input of the delay cell DV1 (223) to the MUX 225 output DLYM of 
the variable-stage delay 222 depends on the digital command code on line 232, which selects only one 
path of MUX 235 from MXI through MXM to DLYM. Assuming that delays in MUX 225 from MX1 through 
MXM to the output of the MUX 225 are all the same (T mux ), and the digital command code selects the path 
of MXI to DLYM, or if i delay stages are selected in the variable stage delay 222, the total delay of the 
course step unit 21 1 can then be obtained: 



Tooars. = K (T^p) + T dop + T mux + i (T^) 
30 = T int + ifr step ) (i=1.m) 



where T int is the "fixed" part of the delay unit, or its intrinsic delay. The value of parameter k and m are 
chosen to guarantee that under all processing and operating conditions that the unit delay element 

35 adjustable range is large enough to be able to produce the nominal delay required (l/2Tb). T step , the delay 
of the delay cell 250, is the base delay or the minimum adjustable delay of the operation which determines 
the resolution of the operation. T int , is the intrinsic delay of the delay unit which will determine the up-limit 
of the operating frequency. The delay unit 21 could be built by just using the "course" step unit itself if 
both T step and T int are satisfactory for the frequency targeted. The addition of a fine step unit 212 can be 

40 beneficial for both higher frequency operation and for lower frequency or larger frequency range operation. 
For the former, a even finer adjustment than T step is achieved in the fine step unit 212 to improve the 
resolution to reduce the timing error. For the later case, T slep can be chosen a larger value (10x for 
example). Fine step unit 212 can use exactly the same structure as the course step unit 211 with its base 
delay cell delay being a finer value (one 10th of T step for example), or it can be built with a different 

45 structure. In any case, it is the digital command code on bus 232 that also controls the delay of the fine 
step delay unit 212. Assume for this example that a coarse step adjustment is applied to each delay 
element. If a total of 10 (2N =2x5 = 10) coarse step adjustments are applied to the delay unit at the same 
time, a total delay range adjustment of 10xT step will result for the whole delay line. Another requirement for 
the delay units is a matched signal rise-to-rise and fall-to-fall delay to ensure a non-distorted pulse width 

so while a pulse is traveling through the delay line. Inverter 213 is used for inverting the polarity of the signal 
from DLYIN 210 to DLYOUT 220, so for every other delay unit (or a bit time delay) the signal is not inverted 
and that any mismatches in the rise and fall delays inside the delay unit will be canceled. 

With reference to Fig. 2a, the digital command from the Up/Down Shifter 90 f on line 1000 controls each 
of the delay units 21 and 21' such that the delay of each equals to half of the bit period (1/2 Tb), accurate 

55 to ± one step in the delay unit, provided that all the delay units are identical and that proper codes are 
chosen at any particular instance. 

With reference to Fig. 2a and Fig. 14a the delay line 21 detection is performed by "sampling" the level 
status of the delay units by a registration process which looks at each monitored node of delay units from 
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fh T n 3 < 5 in OU9h l 61 ^ UnitS 501 thr ° U9h 506 ° f phaSe detector and of P°* delay 60'. The monitored node is 

ho° 2 Z to^VS TJoTnaT- b r?J tW ° C0nS6CUtiVe n ° deS in -V or^e e con 0 d on 
edge of the LCLK to JrJT 9 "" ,St f s /° sam P les the 'evels of each monitored node at the rising 
eage of the LCLK to locate the position of the previous DTRCK rising edge Alternatively the task Si 

::zvixz:^s Tr risin9 ed9e cou,d ais ° be tszz ii* 
rsssinr- position of the previous risi - ^ ^ ™^rr s ^ 

value"!? ITrb wh^n r b ° dime : t ° f Fig " 143, " the de ' ay ° f the dela * units is very «ose to the nominal 

™ «^ZL^lZ"JT h DTRCK / rriV9S 3t lnPUt t0 ° e,ay C <-P-*>tion 52', the previous 
' u rising edge should be somewhere around or near the nodes at the interface of delav iin P 91 =>„h 

the levels I of S IZSL T T ' M,ng * fiSing ed9e position is ° bserved * this region when 
the levels of the monitored nodes are latched into the D type Flip-Flop registers clocked bv DTRTK Z 
region , s where the timing or phase difference between the sampling cloc DTRCK on L 

delays are correct when sampling takes place. If the delay of the delay units 21 ,s not ex^ C 1/2 Tb ,hSn 

on either the - - <™ or onrs^:: 

25 throu^/riT ^ " *** ^ ^ ^ * te 3,S ° US8d in the pha - detection delay 60' (501 

We group the inputs to the sampling register 70 1 from different nodes of the delay line 21 from Ph»«> 
»^ 

- takinj p ce when ft? sam ST 001* JX^ T^f ^ ^ ^ Wh ™ 3 "° "> 1 " 

re,ative D y narrow J ^^0^^^^ ^ ^ 

srs^£rr. n ^ ta sm r than the puise width ^^xrK: 

the DFPs t thiT C t ha H 9 , e t ° r tranSiti ° n ° CCUrS Wh6n the Samplin 9 r ^ ™ '^ches L^e status into 
delay of he delav Ttl 21 Chan96 ° r Supp,y power noise «—» delay variations sotat the 

delay of the delay units 21 ,s no longer equal to half of the bit period, or the total delay of delav lin* ?1 i £ 
no longer equal to the byte period, the transition detected will be on one ide of ttJ wrl 
indications whether the de.ay of the delay unit 2V is shorter or longer than the nornfna, I e ^ 9 ' Vm9 
<° Fg 14c shows the waveforms at the input of the sampling register 70" Fiq 14a and th* .tat,,* 1. 
sampled for d.fferent timing conditions. Line 531 shows the sLus when the delay un* ar -<2iS? to 

dtSckTs'"!!" 6 h or h the de,ay , ,ine is " in -' ock " Line 532 Sh0ws the stat - ^eTZs^TLX 

s,», The o, ,» phase „.,«,„„ detey 6 J t0 _ de|ay .h.^.pjc 
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at the window center position. The additional delay caused by delay units 501 through 503, as well as the 
setup time of the D flip-flops in sampling register 70, is to be compensated by the delay compensation 
block 52'. 

The calibration of the delay unit, in operation, includes two modes, the initial "power-up" mode and the 

5 normal operation mode. These modes are more fully described in copending application 93304079.2. 

Fig. 14b is the block diagram of one embodiment of the Ruler Adjust Decision Circuit 80 f (Fig. 2a). The 
inputs to this circuit is the status code outputs from the sampling register DST(j:1) which are provided by 
the sampling process once every byte cycle. Complex gate 84* performs the function of rising transition 
detection. Only if the two inputs to the XOR gate 82 are in different logic levels and the one on the left is a 

to logical one, then the output of the AND gate 83 will be high, indicating a "0 -> 1" transition or a rising edge 
transition detected between the two consecutive nodes 801 and 802. There should be only a single "1" in 
the XDST (j:1) input bus lines for any sampled code since there is only a single rising edge in the delay line 
21 per sampling cycle, if the delay units are calibrated to or close to nominal value of 1/2 Tb such that the 
whole delay line 21 delays a time equal to the byte period. If more than one "1" exists in XDST(j:1), a 

75 "noise" condition, or a "locked-on2x" condition (meaning that the delay value of the delay units are close to 
Tb instead of 1/2 Tb) is indicated. The Status Decode Logic 85' decodes the status code XDST(j:1) into a 7- 
line detection code on line 851 through 857. 

Fig. 15a and 15b illustrates the function of the Status Decode Logic 71. Line 800, Fig. 15a indicates the 
"window center" position. If the detected transition is within the region of -0.5 and +0.5 delay cell from the 

20 window center or in the zone D, E, XWC is "1". If the detected transition is in the region of one delay cell 
on left (right) side of the window center, then XD1 (XU1) is logic "1 ". If the detected transition is within three 
delay cells on left (right) of the window center, XD3 (XU3) becomes logic "1". If the transition is in the 
region one delay cells away on left side of the window center XD (XU) is a logic "1". Note the overlaps of 
the regions. The status of this detection code combination gives the information as where the transition is 

25 detected and how fast this detected transition is moving from byte cycle to cycle. Both are useful when 
implementing a digital filter function. The Adjust Update Control block 72', Fig. 14b, is not a pure traditional 
combinatorial logic circuit, because the output of this circuit is dependent on the current input code 
combination, and on the history of the input/output. Also, in high speed data communications, the byte clock 
rate is in the range of ten's of MHz. As a result, the status sampling rate is much higher than the delay 

30 variation rate caused by normal temperature drift or supply voltage variation. It is necessary to "filter" out 
the digital "noise" in the status codes, and to ignore the temporary delay changes caused by high 
frequency power supply noise. Different digital structures can be used to implement the filter function. 
Generally, the filter function should remove or ignore rapid phase variations caused by rapid electrical 
noise, while passing the slow and steady phase changes caused by slow delay changes to the controls of 

35 the Up/down Shifter 90 to make proper command code adjustment to maintain constant delays of the delay 
units. 

Fig. 15b shows a function table for implementing the Adjust Update Control 72, Fig. 14b. Columns 901 
through 907 are the status code inputs from the Status Decode Logic which is sampled for every byte 
clock. Columns 908 through 910 are three intermediate variables. Outputs are shown in column 911 through 

4o 914 where UD and ROTEN are the control outputs to the Up/down Shifter for delay adjustment. ROTEN = 
1 enables the adjustment when control update is necessary. UD gives direction indication as how to make 
adjustment (UD = 1 for step up and UD = 0 for step down) to increment or decrement the delay of the delay 
unit. FS is the switching control between Fast adjust mode and Slow adjust mode. RESTART = 1 enables a 
reset process in the event of a consistently detected error status. 

45 With reference to 15b t row (a) shows the situation when the detected transition is in zone H, Fig. 15a 

right of XU3, as in the case of power-up. This is the zone for INC-DLY-FAST mode (UD = 1 and FS = 1), 
meaning delay of the delay unit needs to be incremented with large (coarse) step to be increased fast. This 
fast mode corresponds to a smaller loop time constant as in the PLL case. In row (b) and (c), the detected 
transition is still on the right side of the window center, so incrementing the delay unit is still necessary. 

so However, assuming the transition is already within + -3 delay units, or +- 3xT step from the window center, 
in zone G in Fig. 15b, no fast adjustment is to be made in this instance since each such adjustment of a 
delay change of SxTstep is too large and may "overshoot" to zone A on other side. This is the zone for 
INC-DLY-SLOW mode (UD = 1 and FS = 0), meaning delay unit incrementing is made with fine steps which 
correspond to a large loop time constant as in the PLL case. Row (d) and (e) indicates the IN-LOCK 

55 condition, which means that the total delay of the delay line is within +- 0.5 T slep comparing with the- 
reference byte clock. No adjustment is necessary for this condition (ROTEN =0). Rows (f), (g) and (h) 
indicate similar situations as in rows (a), (b) and (c) except that the delay of the delay unit needs to be 
decremented (UD = 0) instead. Fig. 15a provides a means to understand the overall operation of the adjust 
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update control 72 relative to different zones in which the transition is detected. Row (i) and (j) shows error 
status ,n which e.ther transitions are detected on both sides of the window center, or none is detected 

£%^TZ£f?£f" reSUlt ° f diQita ' n ° iSe in th8 St3tUS Samplin 9 P rocess ' and sh °" ld therefore 
be ignored The RESTART becomes active only if the errors are persistently detected. Another error status 

of i 2 Th U A ' J\ °r£? Stab ! e / an9e ° f th9 de ' ay Unit iS lar 9 e enou 9 h that ^ delays time interval of Tb instead 
aZ f A ' N --° C K c °ndit.on could be erroneously resulted since one transition should be detected in the 
middle of the detay l.ne 20, while a second transition can be detected around the window center This is 
indicated m row (k) of Fig. 15b as a LOCKED-ON-2x condition, similar to the case in PLL when the PLL is 
locked on the second harmonics of the reference clock. One solution to this problem is to restart the delay 
cal.brat.on process by setting the control code to the delay units in such a way that all the delay units select 
minimum delay value as the initial setting. In doing this, the transition is first detected on the right side of 
the window center to avoid the L0CKED-0N-2x condition. Note the in the table in Fig. 15b, which means 
the controls become activated only if a certain condition is satisfied. For example, if the last value for UD is 
0 (meaning delay decrement) while the current status code changes to XDN =0 and XUP = 1 (meanino 
delay increment), one option is to change UD immediately to "1" if it is in the slow or fine adjustment 
mode since only a small step adjustment can be made regardless of how big the delay adjustment is 
.nd.cated by the detection, just like a fast spike on the phase detector output 7s "smoothed' by a 
capacitor m the low pass fitter. On the other hand, if it is in fast adjustment mode while a sudden change to 
director. ,s indicated, further detection results may be needed before a reverse direction adjustment is 

Zf,tJL an LT e \. th9Se tyP6 ° f di9ital ,iiter functions can be implemented to make correct delay 
adjustment with less timing error or resulting jitter. 

DIGITAL PHASE SHIFTER (FIG. 1a, FIG. 16-21) 

^i/ h r di9 t ita ' PhaSe ° f my invention ' as indicated a b°ve, must act as if it were an infinitely long 

tJn, J °, ac , CO T° date the fact that there may be a slight and constant difference between the 
frequency of the local clock of the receiver and frequency of the transmitter 

DhJ^hmfw 6 Shift6r U ! MiZeS 3 taPP6d d6lay " ne and 3 MUX 10 f0rm a di 9 ital| V ad j ustab,e d *lay to provide 
££S fl Tf S ^ mPUt S,9nal - An Up/d ° Wn COntr0 ' Si9nal controls the increment or decrement 
eth h T 7 S ° ^ 3 P0SitiV6ly ° r ne9atiVely P hase - Sh ^ d copy of the input signal is selected 
dele In. 6 TT° ' S t enableC \ The ad i^table delay is able to provide phase shifting in 360 

S„ h 9 V ° Perat,ng COnd,tions - An on - tne - f 'y Phase Shift Range Calibrator detects the delay 
variation due to temperature or power supply variation and determines the output tap known as the End 
Stage Position (ESP) in the digitally controlled delay from which the input signal is de ayTd exactly 360 
degree. In the event that the up/down control indicates that a constant positive (up) phase shiftTs deeded 

ES^aThedT ? 1 ^JT" "* **" ^ "■ ecHon ™ tap " P every enab.ed clock unt!l the 
ESP is reached, f a further up-sh,ft ,s still mdicated, the phase shift control will "wrap around" to select the 

the coTn ti " 7th t 3 P ° SitiVe Ph3Se Shm St6P iS made in relative to the las * Election where 

the control selects the tap output stage indicated by the phase range calibrator as the ESP The same 

ZtTnZ " Pe h rf ° rme h d « J* 81 ' ^ UP/d ° Wn COntr0 ' iPdiCateS that 3 ne 9 ative < down > P»«» ^ift' is needed. 
JiLtT 9 P sh '" ,nG -. rec ' u,res on| y an uP/down and an enable signal from the phase detection and 
adjust decision logic The digitally controlled or selected delay is adjusted one step at a time and by virtue 

l^LTT ' IS de J 9ned t0 be aWe t0 " freeZe "' ° r h0,d ' a P° sition in case there is no data to 
compare. The previous phase adjustments are also able to be "memorized" by the control logic which is 

fm™ S6n f UP/d ° W " COntr0 ' Si9nal ' SUCh * hat C6rtain ad J us tment "patterns" or some modulating 
frequencies from noise sources may be recognized and filtered by the control logic 

W.th reference to Fig. 1b the digitally adjustable delay 50 receives BITCK 16 and performs phase- 

T idin9 ° Utp l from SBITCK 16 ' w "ich is the phase-shifted copy of the original signa. 
BITCK 16, as shown also ,n F,g. 2b. Waveform SBITCK 16' is a phase shifted copy of waveform 16 by any 
S^CcZ™' p , PhaSe D f ift is c°ntrol.ed by the parallel digital control code 68 from the Rotating Phase 

k.¥o!? 9 336 Sh,ft C ° ntr01 60 ° nly requires a si 9 nal UPDN 89 and UPDNEN (up/down 

enable) 88 rom the circuit which informs this circuit to provide positively increasing phase shift (UP) or less 
or negatively increase phase shift (DOWN) form the current phase shift amount. UPDN 89 is "1 " when 
further positive phase shift is needed, and is "0" when negative or less phase shift is desired. UPDNEN 88 

Iv* 6 , nMh T 1 u 8nab,in9 , * 9nal - UPDNEN iS 3 " 1 " " the ad i"stment needs to be performed upon the 
arrival of he clock rising edge, and is a "0" if current phase shift is to be maintained frozen Phase Shift 
Range Calibrator 70 Fig. 18. constantly monitors on-the-f,y the delay change due to variations in operating 
conditions, and sends a parallel digital code 79 to the rotating phase shift control 60 to notify the posftion of 
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the stage in the delay line where the original signal is delayed a full cycle'or 360 degrees. RDY 36 are the 
signals from other circuitry to signify that the calibration is ready to be performed. RES 32 is the reset 
signal for resetting the controls during the power-up sequence or in other occasions when a re-start is 
necessary. IRG 361 is the In-Range signal when the phase comparator 73 finds signal PHO 711 and PHK 

s 712 are almost phase aligned, Fig. 18a. 

Fig. 16 shows one embodiment of the Digitally Adjusted Delay which consists of a series of cascaded 
inverters 509 and MUX 541 or other fast path selector. The input to the MUX are the taps from the delay 
chain for every other inverters via lines 511, 512, etc. for a total of N pairs of inverters. The control inputs to 
the MUX, S(1) through S(N) 68, are outputs from the Rotating Phase Shift Control 60, Fig. 19 and Fig. 1a 

10 which are all "O's" except one "1" which selects a corresponding delay path. The control inputs S(0), which 
would select input on line 510 if it is a "1", is tied to "0" to disable the selection. If stage DK, Fig. 16, is the 
ESP, i.e., is found to be the stage at which signal BITCK is delayed or phase shifted a full 360 degrees, the 
delayed signal at the output of the stage DK on line 515 should be coincided with that at the input on line 
16. This can be better seen in Fig. 17, the timing diagram of the delay, where the output signal SBITCK(K) 

rs of 515 is coincident with signal BITCK 16 on line 16. A next positive phase shift step can be achieved if the 
selection moves to stage 1, shown as SBITCK(1) 511, where a phase shift 411 equal to the delay of two 
inverters is shown. It is seen that an endless positive phase shift can be achieved if the digital control 68, 
Fig. 16 steps through from S(1) to S(k) then rotates or "wraps around" to S(1) and so on, where S(K) 
corresponds to the 360 degree phase shift end stage or the ESP. In an integrated circuit format, the delay 

20 of the inverters, in the Digitally Adjustable Delay 50 varies with process, environment and power supply. For 
a given cycle time or 360 degree phase shift, K could be a different number due to above factors. To 
acquire accurate 360 degree phase shift rotation, the delay of the inverters have to be constantly detected 
so an accurate ESP location can be maintained for calibration. 

The block diagram of the Phase Shift Range Calibrator 70, Fig. 1b, is shown in Fig. 18. Two digitally 

25 adjusted delay's 50A and 50B, constructed exactly the same way as that of 50, Fig. 16, are used to 
calibrate the delays to find the ESP location. The controls to delay 50A are set such that it selects the first 
path of the delay by connecting S(Q) to "1 " while the rest to "0", thus acting as a "dummy" delay. The 
selection for delay SOB is controlled by parallel digital code 79 from the delay adjust up/down shifter 75 
which is in turn controlled by the up/down signal UD 77 and adjust enable signal UDEN 78 from the phase 

30 comparator 74. Controls S(J) through S(N) determines the calibration rotation based on the assumption that 
at least J stages are required to delay a full cycle for a given signal in any process or operating conditions. 
At initial power-up, the control may be reset to a pre-specified code to select a delay path, which is often 
not the correct path to delay the full cycle or 360 degree of the signal. The phase difference of delayed 
signal at PHO 711 and PHK 712 is detected by the phase comparator, and an UD signal is generated to 

35 signify the delay adjust control up/down shifter to add or subtract delay stages to reduce the phase 
difference. If, after a number of adjustments, signal delayed by delay 50A at 71 1 and signal delayed by SOB 
at 712 are in phase, the delay path of delay 50B is the path that provides a 360 degree phase shift o the 
original signal delayed by 50A. The code 79 for selecting this 360 degree delay path contains a single "1 ", 
the position of which defines the ESP location, or stage DK corresponding to C(K). The difference of the 

40 delay path in 50A and 50B is the delay of pure K delay units consisting of 2K inserters, since the other path 
delay or the "intrinsic" delays for the two are the same and should be concealed out. The output code 79 
(C(J) through C(N)) contains the ESP information and is used to inform the rotating phase shift control 60, 
Fig. 1b of the ESP location. The phase comparator circuit 73, Fig. 18B could be any phase detector circuit 
which detects the phase difference of signal PHO at 711 and PHK at 712, and which ensures that correct 

45 delay adjust control signal UD77 and UDEN 78 are generated such that phase alignment of PHO 71 1 and 
PHK 712 is eventually achieved and that the phase difference is 360° or the delay difference is a full cycle 
of signal BITCK. Fig. 18B is a preferred embodiment of the phase comparator 73 for providing the delay 
adjust up-down control signal UD 77 and UDEN 78. The phase comparator circuit 73 is more fully 
described in the simultaneously filed related application, "Digital Variable In-Lock Range Phase Compara- 

.50 tor", which has been incorporated herein by reference. 

Fig. 19 is a detailed block diagram of the Rotating Phase Shift Control 60, Fig. 1b. It contains flip-flop 
register stages ST(1) through ST(N) to form the control of a bi-directional, 360 degree phase range rotating 
shifter, where each stage controls the selection of its corresponding delay path in the digital adjustable 
delay 50, Fig. 1b. If its output is a "1 the delay path or the corresponding MUX in digital adjust delay 50 is 

55 selected. After the 360 degree phase shift range calibration, a bus having code 79 is generated from the 
Phase Shift Range Calibrator 70, Fig. 1b. as described above. The code is shown in box 601 where the 
location of the "1 " is indicated by an arrow 603. In the example shown, stage ST(K) corresponds to the 
ESP. Positive or "up" incremental phase shift adjustments are executed in a rotational sequence indicated 
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^ >O i°L 60 ^ A " 1 " (indicated in the out P ut code shown ^ box 602) is shifted to the right at each enable 
clock CK Th.s continues until the "1 " is shifted to the ESP stage ST(K) such that S(K) becomes a "1 - This 
means that the signal has been phase-shifted for 360 degrees by this code selection. A further positive 
phase shift request from the decision circuit should cause the "1" to be wrapped around to stage 1, or S(1) 
5 will become a 1". and then right-shift again. Note that this rotating shifting takes place in the range of ST- 
(1) through ST(K). as the stage ST(K-M) is "blocked" by the ESP. In practice, when the "1" is shifting 
while the stage ST(i) is away from the ESP, the detection or the exact location of ESP is not important and 
the cal.brat.on to find the ESP needs only to be made in relatively long time interval, or when the "1" is 
approaching the ESP. The error of the 360 degree range calibration due to limited adjustment resolution will 
10 cause an uneven adjustment step at the "wrap around", and results in timing jitter on the "phase aligned" 
or recovered clock in terms of data recovery. Negative or "down" decremental phase shift adjustments 
are executed in opposite direction. Note in Fig. 19, that stages ST(1) through ST(J-1) 62 can be simpler in 
s ructure than those of stage ST(J) through ST(N) 63, because there is no "wrap around" needed for those 
stages. As explained previously, in any operation conditions, there is always some minimum number of 
states requ.red to delay a full cycle of a given frequency signal. Apparently also, the rotation for finding the 
ESP location needs only be performed for stages ST(J) through ST(N). The clock SBITCK 16' for shift 
operation is gated by UPDNEN 88. Proper delays 61 are included to ensure that the signal transitions are 
comp eted from the stages where the delay path is switching to maintain the signal integrity. Line 606 feeds 
,„ ^ k the ° Utpt ? ° f ? 2> t0 the ESP sta 9 e selected in th * event that a "down" or left shift is needed while the 
20 1 has been left-shifted in stage ST(1 ). A "down wrap around" will have to send this "1 - back to the staqe 
where the ESP is located. In normal operations, when a "0" is in ST(1) while a down shift is requested, the 
0 .s also wrapped around to the ESP stage via the same line 606. Outputs WA(I) of all the register stages 
63 are ORed ,nto signal WA or as S(0), which is the input to stage ST(1) in the "up" or right-shift 
operates. In any event, only one "1" is allowed in all the outputs S(1) through S(N) to ensure that a single 
phase sh.ft.ng delay path is enabled. No "1 - should be in outputs on right side of the ESP stage ST(K) 

Fig. 20 shows a preferred embodiment of the register stages in Fig. 19. Register stage 62 is one of the 

-n n ? P S£ 9 J?S r iS US8d f ° r Stag6S ST(1) thr0U9h ST(J - 1) ' De Pending on whether UPDN 89 is a "1" or a 
0 MUX 612 selects input from the output on left (previous) stage on line 625', or from the output on right 
(next) on l.ne 611'. When the gated clock CLK 16" arrives, a "up" or "dawn" shift is performed. Register 
stage 63 .s the type of register with combinatorial logic used for stage ST(J) through ST(N) Fig 19 
Register stages 63 are the same as register 62 with addition to register function 62 of a logic block 614 It 
takes the output of MUX 612 on line 613 for selecting inputs from either the previous stage S(l-1) on line 
625, or from the next stage S(l-1) on line 611, depending on control UPDN on line 615. Other inputs to the 
logic are: the inputs 79 from the phase shift range calibrator 70. Fig. 1b and Fig. 19, both for the current 
stage C(l) on l.ne 617. and for the previous stage on line 616; and input from stage one S(1) on line 606 
The logic for Fig. 19 is easier to understand by referring also to Fig. 21. which contains the equations and 
its tru h table for the function. For each register unit ST(I), there are two outputs: S(l), and WA(I) S(l) 
controls the corresponding input of the digital adjustable delay 50, Fig. 1b WA(I), 608 is the feedback signal 
to stage one ST(1). In the truth table shown, row 701 through 704 is when "up-shift is needed (UPDN = 1) 
while row 705 through 708 is when "down"-shift is required. In row 701 and 702, C(l) is a "0" meaning that 
stage ST(I) ,s not the ESP stage, so normal shift is performed. The status of the previous staqe S(l-1) is 
transferred to the D input of the D flip-flop D(l). and will be latched in S(l) upon the arrival of the clocking 
? Jt?.w u ' -I* mPUt t0 Sta9e ST0) ° r WA(I) iS always a Row 703 an d 704 is for the situation 

ohaT.thlrf^n f St39 f ( ' = K ' C(l) = 1> ° r the 6nd Sta " fr0m which the si 9" al has a "eady been 
phase-shifted 360 degrees. In row 703, if the previous stage is a "0", the "0" is wrapped around to ST(1) 

since WA(I) is a CP, while the current stage is the ESP (C(.) =C(K) = 1), the "1" is not permitted to move 

It nn 6 <5. ( 1 V ), .«» t rather l h3S *° m0ve t0 tne ,irst sta 9 e ST < 1 >- A " wa P ^ound" is performed here by 
mak.ng S(l + 1) a 0 upon the next clocking edge of CLK 16". This can be seen on row 704' where the 
current stage is the stage following ESP, since C(l-1) = 1. During cycle n, if the ESP stage is '"1" (S(l-D- 

%«L Z^*lt\ ( r" t,* 9 T ent Sta9e Wi " be S8t 10 "°" < S <'Wi> = 0). preventing the "1" from 
sh.ft.ng beyond ESP. Meanwhile, at the clocking edge of CLK 16" on cycle n, both WA(l) (n , and S(l) (n) have 
been set to "1 (row 704), i.e. S(1) will be set to "1 " upon the arrival of clocking edge of cycle (n + 1 thus 
comp etmg the "wrap-around" "1 " of ESP to stage one or S(1). Row 705 through 708 is for the "down" shift 
operatoons (UPDN = 0). Row 705 and 706 is similar to the case of row 701 and 702 except that they are 
left or down shift operations. Row 707 and 708 is the situation when ST(I) is the ESP stage or l = K This 
t.me whatever is in stage ST(1 ) will be "wrapped around" to ST(I) (l = K). WA(I) is "don't care" since these 
are down-shift operations. The equations for S(l) and WA(I) are shown on top of the truth table At power-uo 
a reset .s made such that all stages are "0" except ST(1) which is a "1", so a minimum phase shift is 
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chosen. Meanwhile, the phase range calibrator resets C(J) to "1" while others to "0", so the stage J is the 
ESP to start. The ESP is expected to move "up" to its correct position after the phase range calibrator 
completes certain number of calibration cycles, at which a IN-RANGE signal IRG 361 is generated,, and is 
sent to the Rotating phase shift control block 60, Fig. 1A, to initiate phase adjustment process. The UPDN 
5 signal for the delay path selection should choose to up-ohift at the start, so a "1" at outputs S(l) is moving 
"up". In any case, no "1" is allowed to be at the outputs beyond the ESP. Equation (A) is for stage 1 
through stage J-1, while the equation (B) and (C) are for stage J through stage N. Circuit implementation 
can be easily achieved by using conventional combinational circuit design methods or by using a logic or 
circuit synthesizer. 

10 

Claims 

1. An all digital method for tracking and aligning the phase of a binary receiver reference signal to the 
phase of incoming serial binary data stream, wherein said incoming serial binary data stream has a bit 

15 rate which is derived from a distant asynchronous oscillator having a specified nominal frequency and 

tolerance, comprising: 

sampling the transition edge positions of transitions of said incoming binary serial data stream; 

digitally analyzing said transition edge positions to determine the average edge position of the said 
incoming binary serial data stream with reference to a local reference signal by comparing said 
20 transition positions to a phase adjusted local reference signal having the same said specified nominal 

frequency and tolerance as said distant oscillator. 

2. The method of claim 1 including providing digital command signals to shift the relative phase of said 
binary local reference signal to reduce the phase difference between said incoming serial binary data 

25 signal and said binary local reference signal. 

3. The method of claim 2 wherein said step of providing digital command signals to shift the relative 
phase of said binary local reference signal includes the step of compensating said local reference 
signal by providing a broadband frequency spectrum comprised of a plurality of time ruler elements 

30 having different base frequencies. 

4. The method of claim 3 wherein said plurality of time ruler elements includes a first and second type, 
and wherein said first type includes a plurality of cycles of equal period and said second type includes 
one cycle of a different period so that the average pulse rate of said local reference signal is equal to 

35 the pulse rate of said incoming serial binary data whereby data recovery can be accurately undertaken 

even in instances of run length of thousands of unchanging value bits in said incoming serial binary 
data. 

5. The method of claim 2 wherein said step of providing digital command signals to shift the relative 
40 phase includes the step of delaying said local reference signal in a multistage delay line; and 

generating an end stage pointer (ESP) indicating which stage of said multistage delay line would 
provide exactly a 360 degree phase shift. 

6. The method of claim 5 wherein said step of providing digital command signals to shift the relative 
45 phase includes effectively connect in said ESP to said first stage of said delay line to wrap the stage 

selection for phase shift of said reference signal back to the first stage of said local delay line whenever 
the delay needed exceeds 360 degrees in phase. 

7. A method for adjusting the phase of a reference clock to track incoming received serial data including a 
so method for analyzing the edge position of binary transitions in said received serial data with respect to 

said reference clock and determining the average edge position of said received serial data, compris- 
ing: 

a. initiating the generation of a time ruler pulse train upon arrival of transitions of said received serial 
data by launching a pulse into, a serially connected plurality of unit delay elements, each said unit 

55 delay element having an input terminal, an intermediate terminal and an output terminal; 

b. connecting the input, intermediate and output terminate of said unit delays into a edge distribution 
analyzer circuit for sampling and analyzing said pulse train; 
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8. 

10 



15 



t C r' a n« a t mP ' in ? 6a w Ch inPUt ' intermediate and ou tPut terminal at the occurrence of a selected 
transition of said reference clock; ao.ci.ieu 

couToMh^di 1 " 16 ^"? 60 "" 9 ? 6 SamPl8d Si9nalS and S3id reference clock t0 P"** a running 
count of the difference between the number of receiver data transitions which are early (E) versus 
late (L) with respect to said selected transition of said reference clock- and 

nh a a l a '^ ,n9 H Said VGrSUS ' ate difference count t0 generate a correction signal for adjusting the 
phase of said reference clock to drive said early minus late (E-L) difference toward zero. 

The method of claim 7 wherein said combinatorial interconnections also provide a running count of the 
difference between the number of receiver data transitions which are earner than a first predeterm nS 

S^X^SS? «** " ^ 3 ^ 

-^el^^ - *- of said 

9 - 2^£"35. 7 S?^S? (l " 0) count corrects ,or 180 de9ree phase shifts and said E - L count 

*, 10 ' 1^ T eth0d r 0f ° laim 9 Wh8rein the minimum E " L count and the maximum l-O count are a measure of 
20 the phase alignment in said received serial data. measure ot 

se^ia!' btary'd^r C ° mPriSin9 m6anS SamP ' in9 *»* tranSlti ° n edQe P OSitions of inco ™° 

means for generating a broadband time ruler signal; 

means for adjustably shifting the phase of said broadband time ruler signal- 
means for comparing said shifted broadband time ruler signals to said' edge positions of said 
mcommg serial binary data and analyzing the results of said comparison, said means for adjustably 
shift.ng sa.d broadband time ruler being responsive to said analysed comparison. ^^ably 

so 12. The apparatus of claim 1 1 wherein said means for comparison provides a statistically averaoed 
time ruler to track said statistically averaged transition edge position. 

13. The apparatus of claim 11 wherein said means for generating a broadband time ruler includes- 

<~™ m HT < CaUSin9 S3id broadband time ruler *> have a spectrum containing at least a first and 

of said £3 f qUenCy ' T°I irSt S3id b3Se ,reqUenCy b6in9 the ffe ^ corresponding to the period 
of said local reference clock multiplied by N, where N is the number of pulses in the compensated 
broad band time ruler; and wherein the said second base frequency corresponds to to 

clocLnd ?' T\T T FR 3nd T2 = 1/FVFt ' Wh6re Fr equals the <* the receiver local 

clock and F T equals the frequency of the distant clock. 

14. The apparatus of claim 11 wherein said means for adjustably shifting said broadband time ruler 
includes a 360 degree phase shifter. uier 
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15. The apparatus of claim 14 where.n said 360 degree phase shifter includes a means to identify exactly 
at wh.ch output ,n a long chain of delay elements outputs that a signal in said delay linfis exactfv n 
phase with a signal which is propagating in said delay line; oe.ay une is exactly in 

means to mark said identified output in said chain of delay element as the end point- and 
means to cause the tap of the first element of said serially connected delay elements to orovide 

^ZlnT^L When t a,ar9er ^ th3n iS aVaNable at S3id 6nd »° int is -QuesTed whereTy S 
long chain of delay elements is wrapped back at the end point. 
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